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1. Statement of the Problem 



The foUowing exercise is typical in introductory texts on deterministic finite au- 
tomata (DFAs) : "produce an automaton that recognizes the set of binary strings 
that, when interpreted as binary numbers, are divisible by fc." For example, exer- 
cise 1.30 in |2] asks the student to prove that the language {x | x is a binary number 
that is a multiple of k} is regular for each fc > 1; exphcitly presenting an automaton 
is the easiest solution. 

The traditional (and correct) answer constructs a fc-state automaton that keeps 
track not only of divisibiUty by fc, but also the current residue modulo fc. For 
example, if the input read was 1101, the machine would remember "13 mod fc" . The 
transitions between states are simple: if the automaton's current state is "r mod fc" , 
and the input symbol read is "0", it moves to state (2r) mod fc; if the input symbol 
read is "1", it moves to state (2r + 1) mod fc. 

(This example also generahzes to bases other than binary. Furthermore, even if 
thc input string is encoded in base b, the canonical DFA will still have fc states. It 
wiU, however, contain b transitions from each state.) 

The traditional answer, unfortunately, in general fails to produce a minimal 
DFA. This paper addresses the considerably more difhcult question of "how many 
states does a minimal DFA that recognizes the set of base-6 numbers divisible by 
fc have?" We denote this number by /h(fc) and derive a closed-form expression; in 
the proof, we also describe the states of the minimal DFA in more detail. 

The function fb{k) may be computed by algorithmic means. The author used two 
implementations of the Hopcroft minimization algorithm: an original Perl program 
and the highly-optimized AT&T FSM Package"'"'^ . According to experts in the 
field, no prior work addresses the general case of this problem except through such 
computational alleys. 



2. Interesting Patterns 

The function /fc(fc) exhibits very curious behavior. One interesting pattern con- 
sidcrs fb{k) with b fixed and k = x ■ for increasing values of z. 

Example. Table of fb{k) for 6 = 6 and fc = 2^. (That is, a; = 1, ?; = 2, and z 
ranges from to 10.) 



z 





1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


2^ 


1 


2 


4 


8 


16 


32 


64 


128 


256 


512 


1024 


/6(2^) 


1 


2 


3 


5 


8 


12 


20 


29 


45 


72 


104 


/6(2^+^) -/6(2^) 


1 


1 


2 


3 


4 


8 


9 


16 


27 


32 


64 



Date: September 28, 2003. 

1 



2 



BORIS ALEXEEV 



The successive differences of f&{2^) are the powers of 2 and 3, sorted in increasing 
order! 



Example. Tablc of Jb{k) for 6 = 2^ • 5 = 20 and fc = 30 • 5^ (That is, x = 30, 
y = 5, and z rangcs from to 6.) 
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/2o(30^5-) 
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/20 (30 •5^+^) -/20 (30 •5^) 
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Here, the successive differcnces of /20 (30 • 5^) come in increasing order from two 
sequences: {2 • 4™} = {2, 8, 32, 128, . . .} and {12 • 5™} = {12, 60, 300, . . .}. 



We observe that the function fb{k) manages to pick terms, in increasing order, 
from two unrelated sequences! At first, it is hard to imagine a formula that would 
produce such a function. Investigating this bizarre behavior was the starting point 
for this study. 

3. Main Result 

Theorem 1. Let l{x,y) = gcd{x y) • ^'^s'^ 

00 

fb{k) = Z(fc,6°°) + ^min{¿(6",fc),/(fc,6")-/(fc,6"+i)} 

Q = 

= nnnL(fc,6-^) + ¿U&",^)| 

" l a=0 J 

-4o-l 

= l{k,b-^°)+ J2 ^ib"^k), 

where Aq is the smallest nonnegative integer a satisfying l{k,b°') — l{k,b'^^^) < 
/(6",fc). 

Remarks. The function l{x,y) is not symmetric; indeed, l{x,y) = l{y,x) if and 
only 'ú X = y. 

We use the notation l{k, b°°) to denote l{k, b") for sufficiently large a; similarly, 
the infinite sum can be truncated when l{k,b°') — l{k,b°'^^) = 0. This equahty 
certainly holds for a > log^ fc. 

Lemma |ül shows that the three expressions in the theorem are equivalent. 

To understand the expressions of fb{k) in the theorem, we may draw a table 
hsting a, /(&",fc), l{k,b"), and l{k,b°') — l{k,b°'^^). The first and third expressions 
may be understood fairly simply as written. However, the second expression is 
more difficult; it states that fb{k) is the minimal sum one can obtain by summing 
zero of more elements of the form l{b°', k) (as a ranges from to .A — 1) and then 
the foUowing value of l{k,b°') (that is, a = „4). 

Example. 6 = 6, fc = 16 = 2"*: We can calculate /b(fc) with any of the expressions 
above (for the third, use Ao = 2). The minimal terms of the first expression appear 
underhned below; simultaneously, the minimal "path" 8 = 1 + 3 + 4 (in ternis of 
the second formula above) is indicated in boldface. Note that other paths such as 
15 = 1 + 3 + 9 + 2, 9 = 1 + 8, and 16 = 16 (the trivial ^aXh A = 0) yield non-minimal 
sums. 
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4. COROLLARIES TO THE MAIN ReSULT 

Corollary 2. The following are upper bounds for fb{k): 
fb{k) <k ^l{k,b'>) 

fb{k) <l + 



fb{k) <1 + 



gcd^fc, b) 



l{b°,k) + l{k,b>-) 
l{b",k) + l{b\k) + l{k,b^) 



gcd{b,k) gcd(fc,6^) 
Proof. These foUow immediately from the second expression in Theorem^ 



□ 



Corollary 3. The canonical DFA described in Section^is minimal if and only if 
gcd(fc, 6) = 1 or fc = 2. 

Proof. The canonical DFA has k states and hence we must determine when fb{k) = 
k. 

If gcd(fc,6) = 1 or fc = 2, the first expression of Thcorcm immediately gives 
fb{k) = k. Otherwise, we have ^^^^^ < fc — 1, and by the previous coroUary, 

k 



fb{k) < 1 + 



rcd(fc, b) 



< k. 



□ 



Corollary 4. The successive differences o//6(2^) are powers o/2 and Z, sorted in 
increasing order. 

Proof. Manipulation of the resuh of the theorem yields 

oo 

/6(2-) = ¿(2^6°°) + ^min{Z(6",2-),¿(2^6")-/(2^6«+i)} 

oo 

= l + ^min{3"- r2"--],[2"^"~ij} 

a=0 
2-1 

= l + ^min{3",2"-"-i}. 



It is not difñcult to see that as one increments z i-^ z + 1, a new term of the form 
min {3", 2^^"^^} is added, and the desired property holds. □ 

Remark. A similar approach may be applied to the general case of fb{x ■ y") for 
increasing values of z. In particular, we can easily prove the pattern we noticed in 
SectionHfor /2o(30- 5^). 

Corollary 5. Ifb = p" (p not necessarily prime, but see the remark) and k = p™-x 
with gcd{x,p) = 1, then fb{k) = x + [^]. 
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Proof. We use the first expression of the theorem: 

oo 

ftik) = /(fc,6°°) + ^mm{¿(&",/c),¿(fc,5")-¿(fc,5"+i)} 



a=0 

oo 



X + 

Q = 



As long as na < m, [p""-™] = 1 and [p™-""] • x > [p™-(«+i)"] . There are 
precisely [^] such a (since < a < ^), so we have 

Tf-l-l oo 

/,(fc) = x+ ^ {!}+ ^ {0} 

a=0 "=rvi 

"m" 



as desired. □ 

Remark. If p is prinie, and thus 6 is a prime power, this coroUary completely 
characterizes fb{k), as all k can be represented in the formp™-a; with gcd(a:,p) = 1. 

5. Proof of the Main Result 

Lemma 6. The three expressions of Theorem^are equivalent. 

Proof. By looking at the powers of a fixed prime, we see that fc) and gcd(fc, 5") 
are increasing (not necessarily strictly) functions of a. It is also easy to show that 
gcd(fc, 5"+^)/ gcd(fc, 5") is decreasing, which immediately imphes that ¿(fc,5") — 
Z(fc, 6"+^) is decreasing. Therefore, in the sum 



oo 

min 

a=0 



{lib^,k),l{k,b^)-l{k,b^+')}, 



one takes Ao elements from the first sequence {¿(6", fc)} and then infinitely many 
from the second sequence {l{k, b°') — l{k, 6"+^)}. Telescoping the latter, one gets the 
other two expressions of the theorem. (The cut-off is the smallest nonnegative 
integer a satisfying l{k, b") - l{k, 6"+^) < ¿(6", fc).) □ 

Proof of Theorem^ Constructing a DFA directly, as in Sectionn] is often difficult 
because one must describe the transitions between states in addition to the states 
themselves. We will use the MyhiU-Nerode Theorem and the accompanying theory 
of extension invariant equivalence relations to work with the states of the automaton 
only. 

Definition. Given a language (set of strings) L over an alphabet E, we define the 
extension invariant equivalence relation ~ l associated with L as foUows: strings x 
and y in S* are equivalent (s ~l ?/) if for any suffix z € "E* , xz G L if and only if 
yz £ L. (As is customary, E* denotes the set of aU finite strings over S. Later, we 
use E+ — T,* \ {e} to denote the set of nonempty strings over S.) 

The MyhiU-Nerode Theorem ^ Thms 3.9-10] estabhshes that the minimal-state 
automaton accepting L has, up to isomorphism, one state corresponding to each 
equivalence class of Therefore, the minimal-state automaton has exactly the 
number of states as the index of ~l. (In particular, a language L is regular if and 
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only ií has finite index.) In addition, any DFA recognizing L can be altered by 
identifying ("gluing") some states together to obtain the minimal-state automaton. 

In this proof, we let E be the set of base-6 digits and L the set of base-6 numbers 
divisible by k. In addition, since we work with onfy one language at a time, we 
may write x ^ y rather than x ~¿ y. 

To begin, we wiU restate the problem equivalently in a way that wiU aUow us 
to utiUze modular arithmetic. Because the canonical DFA accepting L has a state 
for each residue modulo fc, the MyhiU-Nerodc Theorem impUes that thc minimal- 
state DFA wiU contain states that correspond to groups of residues modulo k. 
Therefore, in the pursuing anafysis, rather than considering strings of digits, we 
discuss residues; in a way, we are projecting E* onto (in thc natural manner). 
For example, L now becomes very simple: instead of containing aU numbers divisible 
by fc, it contains the single residue (mod k). To complete the reduction, we need 
only bother ourselves with one further 

Definition. Let r G be a residue modulo fc and d e E a base-6 digit. We 
define the concatenation rd to be the residue b ■ r + d (mod fc). Similarfy, ií d = 
dn-i ■ ■ • dido G is a nonempty string of digits, let the concatenation rd be what 
is obtained by successively concatenating individual digits: 

rd = b ■ (b ■ {. . . {b ■ r + dn-i) ■ ■ ■) + di) + da) = b" ■ r + d^-i ■ ■ ■ did^ (mod fc), 

where d denotes d interpreted as an integer. Of course, if d = e, the empty string, 
rd = re = r. 

FinaUy, extend onto Zk'. residues x,y € are equivalent if for any string 
z £ "E* , xz = (mod fc) if and onfy ií yz = (mod fc). 

Now, suppose Ais a nonnegative integer. We wiU describe 



pre-equivalence classes, each a group of residues, which wiU be a refinement of the 
equivalence classes of '^l. 

The pre-equivalence classes we define naturaUy present themselves in packages, 
a term we borrow from computer programming to indicate coUections of classes. 
AUogether, there are A+1 distinct packages, which we number 0, . . . , in addition, 
we wiU sometimes refer to package A as the distinctive package etcetera. These 
packages come in the sizes anticipated from Q: if < a < package a contains 
¡{b",^) pre-equivalence classes, while package A contains l{k,b-^) pre-equivalence 
classes. 

We now define the packages. Suppose < qí < .4. Package a wiU consist of 
those residues r such that there exists a string d of length a such that rd = and 
no smaller a works; furthermore, these residues wiU be grouped according to their 
corresponding d's. MathematicaUy, for each < c < 5" such that gcd(6",fc)|c, 
package a contains the pre-equivalence class {x\b°' ■ x + c = 0}, except those x that 
appeared in package a — 1 or earher. (Note that the equation b" ■ x + c = Q has 
a solution x iff gcd(6", fc) | c.) Because there are precisely &"/gcd(6", fc) = Z(6", fc) 
such c in the desired range, these packages have the stated sizes. Before we proceed, 
note that the union of the pre-equivalence classes in packages through a consists 
of all residues x satisfying ■ a; -f c = with < c < 6", and no others. 



n 




a=0 
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Packagc ctcctcra consists of thc lcftovcrs; mathematically, it is similar, but there 
is no rcstriction on c: for each < c < fc (only to avoid duphcation modulo k), 
package A contains the pre-equivalence class {a; | &-^-a;-|-c = 0}, except those x that 
have appeared previously. Once again, we have the necessary number of classes, 
since k/gcd{k, 6^) = l{k, b-^). 

Example. 6 = 6, A; = 16 = 2*: the pre-equivalence classes for A = 2. This value 

of A was choscn so that thcsc groups corrcspond to thc statcs in thc minimal DFA. 
Strikeouts indicate that the given value of x satisfies b"' ■ x + c = but already 
appeared in a previous package. 



Packagc 


Packagc 1 


Package 2 {etcetera) 


c 


{x} 


c 


{x} 


c 


{x} 





{0} 





{#,8} 





{ 4, 12 } 




2 


{ 5, 13 } 


4 


{ 3, 7, 11, 15 } 




4 


{ 2, 10 } 


8 


{ ^T^, 6, 14 } 






12 


{ é^, 1, 9 } 



Recall once more from the statement of the theorem that Ao is the smallest 

nonnegativc integer a satisfying l{k,b°') — l{k,b"~^^) < l{b°',k). 
We makc three separatc claims: 

(1) for any A, our pre-equivalence classes coincide with the equivalence classes 
of with two possible exceptions: some pre-equivalence classes may be 
cmpty and somc prc-cquivalcncc classcs in packagc ctcetera may actually 
be equivalent (both of these would produce an overcount); 

(2) for A<Ao, all the pre-equivalence classes are nonempty; and 

(3) for A>Ao, the classes of package etcetera are actually inequivalent. 

It follows that for A = Aq, our pre-equivalence classes are precisely the MyhiU- 
Nerode equivalence classes of ~í:,. 

We begin by affirming (1): if two residues r and s are in the same class of 
packagc a, thcre cxists no string d of length less than a such that rd = or sd = 0. 
In addition, r ■ 6" = s ■ 6", so for any string d of length at least a, we have rd = sd. 
Therefore, r and s are equivalent, and the pre-equivalence classes are a refinement 
of those oí 

Moreover, if r and s are in different classes and at least one of r and s is not 
in package etcetera, then r ^ s. Indeed, if r and s are in different packages, the 
result is obviously true. If r and s are in different classcs of thc same package a 
with a < we can also conclude that r ^ s because r and s satisfy 6" • a; -|- c = 
for different values of c; therefore, there exists a string d (namely, the d such that 
cí = c) of length a such that rd = but sd^ 0. 

Before continuing, we note the significance of Aq . If q; < , thcn 

l{k,b'^-^)-l{k,b'^)>l{b'^-\k) ^ . I^í|^ < fc _ 6«-!, 
and if a > Aq, then 



l{k,b°'-\-l{k,b°') < l{b''-\k) 



MINIMAL DFAS FOR TESTING DIVISIBILITY 



7 



Equippcd, we proceed in order to (2). Suppose A < Ao] then, we claim that for 
any fixed < a < ,4 and c such that gcd(fc, 5") | c, there exists an x satisfying 

(t) 6" • x + c = 

which does not satisfy b"'^^ ■ x + c' = with < c' < b°'~^. Indeed, consider all 
X satisfying and note that these x are spaced apart equally with ggjjfc fac) sepa- 
ration between consecutive solutions. Multiplying these x by yields (possibly 
duphcate) residues b"^^ ■ x spaced k ■ ^^fj¡^^ 1,^^'' apart. But, because a < ^ < Ao, 

k-''''t'7?<k-b^~\ 

gcd(fc,fe") - 

whence there exists an x satisfying such that (6"~^ • x) mod k is in between 1 
and k — and such an x cannot satisfy ■ x + c' = with < c' < 6"~^. 

Therefore, aU of the classes of packages through A are nonempty. 

We finish with (3). Suppose A > Aq; it sufRces to show that if r ~ s and a is 
the minimal a such that b" ■ r = b" ■ s, then a < A. Assume the contrary: a > A. 
Then, r and s are both solutions of for a fixed c. To derive a contradiction, we 
again focus on the spacing of solutions of (^. So, consider all x satisfying (^; they 
are spaced gcj^fc ¡,0.-^ apart. As before, the residues 6"~^ • a; for 2; satisfying (^ are 

spaced k ■ ^gcd^fcj,c) ^ apart. However, because a > A> Aq, 

gcd(fc,6") 

and thus there is not enough room for two distinct (b"^^ ■ x) mod k in between 
1 and k — 6"^^. Therefore, either 6"~^ • r = 6"^^ • s or one of r and s satisfics 
b"^^ ■ X + c' = with < c' < 6"~^. The former contradicts the minimahty of 
a, and the second is impossible as well: without loss of generahty, r satisfies such 
an equation. But thcn, there exists a string d of length a — 1 such that rd = 0. 
Because r ^ s, it foUows that rd = sd = for a string of length a — 1, once 
again contradicting the minimahty of al We have reached a contradiction in aU 
cases, therefore our assumption was false and a < A. Therefore, any two residues 
are "distinguished" at or before a = A, and it foUows that any r and s in the 
package etcetera are equivalent if and only if they are in the same pre-equivalence 
class. 

At last, we are done. □ 
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For large-scale computations, when speed was crucial, AT&T Research's FSM 
Package was used to compute /b(fc), to complement the author's own programs. 
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